FA4503A Sequence listing .txt 
SEQUENCE LISTING 

<110> National Institute of Advanced Industrial Science and Technology 
National institute of Technology and Evaluation 
National Research institute of Brewing 

<120> Pyroglutamyl peptidase and a gene thereof 

<130> A21773A 

<150> JP 2001-403261 

<151> 2001-12-27 

<160> 10 

<170> Patentin version 3.1 

<210> 1 
<211> 2201 
<212> DNA 

<213> Aspergillus oryzae 
<220> 

<223> inventor: Machida, Masayuki ; Abe, Keietsu; Gomi , Katsuya; 
inventor: Asai , Kiyoshi; Sano, Motoaki ; Kin, Taishin 
Inventor: Nagasaki, Hideki; Hosoyama, Akira; Akita, Osamu 
inventor: ogasawara, Naotake; Kuhara, Satoru; Tokunaga, chikara 
Inventor: Toda, Itaru; Saitoh, Chiaki; Senoh, Akihiro 

<220> 

<221> source 
<222> (1) . . (2201) 

<223> /organism="Aspergillus oryzae" 
/strain="RlB 40" 

<220> 

<221> CDS 

<222> (1001) . . (1111) 
<220> 

<221> CDS 

<222> (1197) . . (1901) 
<220> 

<221> intron 
<222> (1112) . . (1196) 
<223> /number=l 
<400> 1 

tcaagatctg aatgataaca gatttcctag tccattgtaa cacatcgatg gggttgacgt 60 
ggtaatagaa tcccaagact gatggaaatc gttctaggtg aagtcaaagt ctagatgatg 120 
aataaacaac cagactctag gaaaatgctg gtctagaccc ttgggcgaga aggaatgtgc 180 
tgataacgtc tcgctgcctt tcagcggtaa cgctaatcta aaagatcaac aaacaatcca 240 
ggagcaacca gagcaatcgg tgcgtgttca gtaagtgagc agtgagtgca caggagcact 300 
cacgtgctaa ccttacaaaa gcagcggcac ccatatcaaa caggaagaag tgggccgtac 360 
ggtagttcta ggatgacata ccgaaacccc ttatttgttc gcttaaatag atccctgccc 420 
agctttactg atggatttct aatcgcaaag taattgggtg aaataccatc ggtattaacc 480 
tagtgaatgg tgattctcaa ccatcgagta caagtcattc tcactattga actttccaaa 540 
aagccccgtg aacaagcagt ctgcggtttg ccccggctga agcaaggggg aaattgtcgg 600 
tcaggactcg gaacttcgga agcgaagcag aatcggcggt ggccaaaagg catgcgacgt 660 
gacagcacct cacatcattc cgggacaata acataggttc aattgcacaa ttgtctcaag 720 
aacatgggtg attgtcagat tgatacgtca atcaagcttt gtgggcggtc aagatgaggg 780 
gaggtcatgt gccttatcac cttatcgata tcgatatcgc gtgatgccaa gacctgcatg 840 
cggtggttgt aatgcggggg aagctccgtc gatatcttga atatatcttt agtccctcct 900 
ctctatcctt tttgtggcgt acatagctac cgtgtatata cgaagtaaag gcgttgttcc 960 
caccactgat tcctagcttg ccttgaccta tccactagcc atg gga gac ttt ggc 1015 

Met Gly Asp Phe Gly 
1 5 

cca ccg gtg cca ata ccc gag acg gag gta att gqt ctt get teg tea 1063 
Pro Pro val Pro lie Pro Glu Thr Glu val lie Gly Leu Ala Ser Ser 

10 15 20 

tct ttg aca gat cca gaa gag gtc teg gta ctg gtg aca ggg ttc gqt 1111 
Ser Leu Thr Asp Pro Glu Glu Val Ser Val Leu val Thr Gly Phe Gly 
25 30 35 
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FA4503A Sequence listing .txt 
gtaagtttat ctgtcttatg cttggcttat gtttctttga ccgtcttggc ttctgattcg 



gccctcaaat gctaaaatat actag 



teg tat 
Ser Tyr 



gcg 
Ala 

gtc 
val 

att 



aag 
Lys 



ttg 
Leu 

ccg 
Pro 
65 
cca 
Pro 



cat 
His 
80 

cca acc 



att 

lie 

50 

tct 

Ser 

teg 
Ser 



gec 
Ala 



gg 
gT 



ccc 
Pro 



tea 
Ser 

tec 
Ser 

att 
He 



ate eta gag 



tct 
Ser 



ggg 

Gly 

ccc 
Pro 
85 
gat 



gqa 

lie Pro Thr lie Leu Glu Asp Tyr Ala Lyi Ser His Gly 



cca ttc aag acc aac eta 
Pro Phe Lys Thr Asn Leu 
40 

ctg cca gag teg ctt gac 
Leu Pro Glu Ser Leu Asp 
55 

cct act tct cgt egg 
Pro Thr Ser Arg Arg 
70 

gtc get tac tea aca 
val Ala Tyr Ser Thr 
90 

tac gee aag tec cat 



att 
He 
75 
gtg 
val 



95 

cca gac att gta 
Pro Asp lie Val 



teg 
Ser 

aaa 
Lys 

cag 
Gin 

gta 
val 
175 
aac 



att 
He 



gg 

Gl 



etc 
Leu 
160 
gta 
val 

acg 



gag 
Glu 

aga 
Arg 
145 
ccg 
Pro 

egg 
Arg 



acc 
Thr 
130 
ate 
lie 

cca 
Pro 

aaa 
Lys 



etc 
Leu 
115 
aag 

Lys 

ggt 

Gly 

gta 
val 

gtt 
val 



tgg aag teg 
Asn Thr Trp Lys Ser 
195 

gag gac get 
Glu Asp Ala 



100 
cat 
His 

gcg 
Ala 

tat 
Tyr 

etc 
Leu 

etc 
Leu 
180 
ttt 
Phe 



atg 
Met 

cat 
His 

gaa 
Glu 



aca act 
Thr Thr 

cga cga 
Arg Arg 

105 110 
gqa ata gcg get aca aga teg tac tac 
Gly lie Ala Ala Thr Arg Ser Tyr Tyr 



gtc 
val 

ctt 

Leu 

60 

tea 

Ser 

egg 
Arg 

gqt 
Gly 



aat 
Asn 
45 
cct 
Pro 

att 
He 



gee 
Ala 

teg 
ser 

cat 
His 



cga gat 
Arg Asp 
135 



cag 
Gin 
165 
cac 
His 



gat 
Asp 
150 
get 
Ala 



ggg 

Gly 

gqt 
Gly 



120 
tct 
Ser 

gag 
Glu 



tac 
Tyr 

aag 
Lys 



cac 
His 

gtt 
val 



cct gcg gcg 



ttg 
Leu 

tgg 
Trp 
155 
gat 



tec 
Ser 
140 
agg 
Arg 



125 
gat ate 
Asp lie 

gag cag 
Glu Gin 



tec aca gac 



gee cag gcg 
Ala Gin Ala 
225 

cct 
Pro 
240 
get 



gqa 
Gly 
210 
ttt 
Phe 



cgc tac 
Arg Tyr 



caa 
Gin 



caa 
Gin 



Pro Ala Ala Asp Ser Thr Asp 
170 

ccg ccc aat gac gac ttt etc 
Pro Pro Asn Asp Asp Phe Leu 
185 190 
gta tct cct gqa gca gac gtc egg ata tec 
val Ser Pro Gly Ala Asp val Arg lie Ser 
200 205 
ttc ate ttt tat aca agt 
Phe lie Phe Tyr Thr Ser 
220 

cga aac gtc 
Arg Asn Val 



etc 
Leu 

ggc 
Gly 



ccc 
Pro 



tgc 
cys 



gtg 
val 



gg 

Gl 



tct tgc gee gac 



cag 
Gin 
230 
gag 



cag 
Gin 



gag 
Glu 
215 
cac 
His 



gac ate gag aga 



gtt 
val 
235 
ggc 
Gly 



ttc 
Phe 



ttc 
Phe 



ctg 
Leu 

cat 
His 



acg 
Thr 



gat 
Asp 



att 
lie 



gca 

Ala Ala 
255 



gga 

Gly 



age 
Ser 



gag 
Glu 



cag 
Gin 
270 
gcatgggcac 



Ser Cys Ala Asp Glu Asp lie Glu Arg 
245 250 
ttg ate aaa get ctt gta aga tgt tgg gtt 
Leu lie Lys Ala Leu val Arg Cys Trp val 
260 265 
gta tag ageggcatge aggttgctgg tatcgttttg caaagcaaga 
val 

tggacgatat atatacttgc atttctatgg cgcggtgcac tatctgggtt ccggatgcgc 
ttttagctgc agtcactcgt gatcatttat ttatagggga cttctgtccc eggtctttte 
aggttgagtt atacatgttt cacaggtttt ggatacacta tttaccctct gactactatc 
gatgaatata gacagttgtc aagcatgata ttgggttcta ccgtattcgt atatgtgtag 
aatttcccgt 

<210> 2 
<211> 271 
<212> PRT 

<213> Aspergillus oryzae 
<400> 2 

Met Gly Asp Phe Gly Pro Pro val Pro lie Pro Glu Thr Glu Val He 
15 10 15 

Gly Leu Ala Ser Ser Ser Leu Thr Asp Pro Glu Glu val Ser val Leu 
20 25 30 
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1171 
1223 



1271 
1319 
1367 
1415 
1463 
1511 
1559 
1607 
1655 
1703 
1751 
1799 
1847 
1895 

1951 

2011 
2071 
2131 
2191 
2201 



FA4503A Sequence listing .txt 
Val Thr Gly Phe Gly Pro Phe Lys Thr Asn Leu val Asn Ala Ser Tyr 

35 40 45 

Leu lie Ala Ser Ser Leu Pro Glu Ser Leu Asp Leu Pro Ser Ala Lys 

50 55 60 

Pro Ser Gly Ser Gly Pro Thr Ser Arg Arg lie Ser lie His val His 
65 70 75 80 

Pro Ser Pro lie Pro val Ala Tyr Ser Thr val Arg Thr Thr lie Pro 

85 90 95 

Thr lie Leu Glu Asp Tyr Ala Lys Ser His Gly Gly Arg Arg Pro Asp 

100 105 110 

lie val Leu His Met Gly lie Ala Ala Thr Arg Ser Tyr Tyr Ser lie 

115 120 ~ 125 

Glu Thr Lys Ala His Arg Asp Ser Tyr His Leu Ser Asp lie Lys Gly 

130 135 140 

Arg lie Gly Tyr Glu Asp Gly Glu Lys Val Trp Arg Glu Gin Gin Leu 
145 150 155 160 

Pro Pro val Leu Gin Ala Gly Pro Ala Ala Asp Ser Thr Asp val val 

165 170 175 

Arg Lys val Leu His Pro Gin Pro Pro Asn Asp Asp Phe Leu Asn Thr 

180 185 190 

Trp Lys Ser Phe val Ser Pro Gly Ala Asp val Arg lie Ser Glu Asp 

195 200 205 

Ala Gly Arg Tyr Leu Cys Glu Phe lie Phe Tyr Thr Ser Leu Ala Gin 

210 215 220 

Ala Phe Gin Gin Gly Gin His Arg Asn val val Phe Phe His val Pro 
225 230 235 240 

Gly Ser Cys Ala Asp Glu Asp lie Glu Arg Gly Thr Asp lie Ala Ala 

245 250 255 

Gly Leu lie Lys Ala Leu val Arg Cys Trp Val Ser Glu Gin val 
260 265 270 

<210> 3 
<211> 1000 
<212> DNA 

<213> Aspergillus oryzae 
<220> 

<221> source 
<222> (1) . . (1000) 

<223> /organism="Aspergillus oryzae" 

/strain= M RIB 40" 
<400> 3 

tcaagatctg aatgataaca gatttcctag tccattgtaa cacatcgatg gggttgacgt 60 
ggtaatagaa tcccaagact gatggaaatc gttctaggtg aagtcaaagt ctagatgatg 120 
aataaacaac cagactctag gaaaatgctg gtctagaccc ttgggcgaga aggaatgtgc 180 
tgataacgtc tcgctgcctt tcagcggtaa cgctaatcta aaagatcaac aaacaatcca 240 
ggagcaacca gagcaatcgg tgcgtgttca gtaagtgagc agtgagtgca caggagcact 300 
cacgtgctaa ccttacaaaa gcagcggcac ccatatcaaa caggaagaag tgggccgtac 360 
ggtagttcta ggatgacata ccgaaacccc ttatttgttc gcttaaatag atccctgccc 420 
agctttactg atggatttct aatcgcaaag taattgggtg aaataccatc ggtattaacc 480 
tagtgaatgg tgattctcaa ccatcgagta caagtcattc tcactattga actttccaaa 540 
aagccccgtg aacaagcagt ctgcggtttg ccccggctga agcaaggggg aaattgtcgg 600 
tcaggactcg gaacttcgga agcgaagcag aatcggcggt ggccaaaagg catgcgacgt 660 
gacagcacct cacatcattc cgggacaata acataggttc aattgcacaa ttgtctcaag 720 
aacatgggtg attgtcagat tgatacgtca atcaagcttt gtgggcggtc aagatgaggg 780 
gaggtcatgt gccttatcac cttatcgata tcgatatcgc gtgatgccaa gacctgcatg 840 
cggtggttgt aatgcggggg aagctccgtc gatatcttga atatatcttt agtccctcct 900 
ctctatcctt tttgtggcgt acatagctac cgtgtatata cgaagtaaag gcgttgttcc 960 
caccactgat tcctagcttg ccttgaccta tccactagcc 1000 

<210> 4 
<211> 300 
<212> DNA 

<213> Aspergillus oryzae 
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FA4503A Sequence listing .txt 

<220> 

<221> source 
<222> (1) . . (300) 

<223> /organism="Aspergillus oryzae" 

/strain= M RIB 40" 
<400> 4 

agcggcatgc aggttgctgg tatcgttttg caaagcaaga gcatgggcac tggacgatat 60 
atatacttgc atttctatgg cgcggtgcac tatctgggtt ccggatgcgc ttttagctgc 120 
agtcactcgt gatcatttat ttatagggga cttctgtccc cggtcttttc aggttgagtt 180 
atacatgttt cacaggtttt ggatacacta tttaccctct gactactatc gatgaatata 240 
gacagttgtc aagcatgata ttgggttcta ccgtattcgt atatgtgtag aatttcccgt 300 

<210> 5 

<211> 816 

<212> DNA 

<213> Aspergillus oryzae 
<220> 

<221> source 

<222> (1)..(816) 

<223> /organism="Aspergillus oryzae" 
/strain="RlB 40" 

<220> 

<221> CDS 

<222> (1) . . (816) 



<400> 5 

atg gga gac ttt ggc cca ccg gtg 
Met Gly Asp Phe Gly Pro Pro val 



10 



ggt ctt get teg tea tct ttg aca gat cca gaa gag 

Gly Leu Ala Ser ser Ser Leu Thr Asp Pro Glu Glu 

20 25 

gtg aca ggg ttc gqt cca ttc aag acc aac eta gtc 

val Thr Gly Phe Gly Pro Phe Lys Thr Asn Leu Val 

35 40 

ttg att gee tea tct ctg cca gag teg ctt gac ctt 

Leu lie Ala Ser Ser Leu Pro Glu Ser Leu Asp Leu 

50 55 60 

ccg tct gga tec ggg cct act tct cgt egg att tea 

Pro Ser Gly Ser Gly Pro Thr Ser Arg Arg lie Ser 

65 70 ~ 75 

cca teg ccc att ccc gtc get tac tea aca gtg egg 

Pro Ser Pro lie Pro Val Ala Tyr Ser Thr val Arg 

85 90 

acc ate eta gag gat tac gee aag tec cat gga ggt 

Thr lie Leu Glu Asp Tyr Ala Lys Ser His Gly Gly 

100 105 

att gta etc cat atg gga ata gcg get aca aga teg 

lie val Leu His Met Gly lie Ala Ala Thr Arg Ser 

115 120 

gag acc aag gcg cat cga gat tct tac cac ttg tec 

Glu Thr Lys Ala His Arg Asp Ser Tyr His Leu Ser 

130 135 140 

aga ate ggt tat gaa gat ggg gag aag gtt tgg agg 

Arg lie Gly Tyr Glu Asp Gly Glu Lys val Trp Arg 

145 150 155 

ccg cca gta etc cag get ggt cct gcg gcg gat tec 

Pro Pro Val Leu Gin Ala Gly Pro Ala Ala Asp Ser 
165 170 



180 185 
tgg aag teg ttt gta tct cct gga gca 
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acg 


gag 


gta 


att 


48 


Thr 


Glu 


val 


He 








15 






gtc 


teg 


gta 


ctg 


96 


val 


Ser 


val 


Leu 






30 








aat 


gee 


teg 


tat 


144 


Asn 


Ala 


Ser 


Tyr 




45 








cct 


teg 


gcg 


aag 


192 


Pro 


Ser 


Ala 


Lys 




att 


cat 


gtc 


cat 


240 


He 


His 


val 


His 










80 




aca 


act 


att 


cca 


288 


Thr 


Thr 


He 


Pro 








95 




336 


cga 


cga 


cca 


gac 


Arg 


Arg 
110 


Pro 


Asp 




tac 


tac 


teg 


att 


384 


Tyr 


Tyr 


Ser 


He 




125 








432 


gat 


ate 


aaa 


ggc 


Asp 


He 


Lys 


Gly 




gag 


cag 


cag 


etc 


480 


Glu 


Gin 


Gin 


Leu 










160 




aca 


gac 


gta 


gta 


528 


Thr 


Asp 


val 


Val 






175 






ttt 


etc 


aac 


acg 


576 


Phe 


Leu 


Asn 


Thr 






190 






624 


ata 


tec 


gag 


gac 


He 


ser 


Glu 


Asp 





FA4503A Sequence listing .txt 
195 200 205 

get gga cgc tac etc tgc gag ttc ate ttt tat aca agt ctg gcc cag 672 
Ala Gly Arg Tyr Leu Cys Glu Phe lie Phe Tyr Thr Ser Leu Ala Gin 

210 215 220 

gcg ttt caa caa ggc cag cac cga aac gtc gtt ttc ttc cat gtq cct 720 
Ala Phe Gin Gin Gly Gin His Arg Asn val val Phe Phe His val Pro 
225 230 235 240 

gga tct tgc gcc gac gag gac ate gag aga ggc acg gat att gca get 768 
Gly Ser Cys Ala Asp Glu Asp lie Glu Arg Gly Thr Asp lie Ala Ala 

245 250 255 

gga ttg ate aaa get ctt gta aga tgt tgg gtt age gag cag gta tag 816 
Gly Leu lie Lys Ala Leu val Arg cys Trp val Ser Glu Gin Val 
260 265 270 

<210> 6 

<211> 16 

<212> DNA 

<213> Artificial 

<220> 

<223> adaptor 
<400> 6 

egagagegge egctae 16 

<210> 7 

<211> 13 

<212> DNA 

<213> Artificial 

<220> 

<223> adaptor 

<400> 7 

gtagcggccg etc 13 

<210> 8 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> primer for amplification of Aspergillus oryzae pyroglutamyl pepti 

dase cDNA 
<400> 8 

gggctgeaga tgggagactt tggcccacc 29 

<210> 9 
<211> 30 
<212> DNA 
<213> Artificial 
<220> 

<223> primer for amplification of Aspergillus oryzae pyroglutamyl pepti 

dase cDNA 
<400> 9 

ggggaattcc tatacctget cgctaaccca 30 

<210> 10 
<211> 312 
<212> PRT 
<213> Artificial 
<220> 

<223> amino acid sequence wherein 41 amino acids containing polyhistidi 
ne-tag are added at N-terminal of amino acid sequence of SEQ ID N 
0:2 

<400> 10 

Met Arg Gly Ser His His His His His His Gly Met Ala ser Met Thr 
1 5 10 15 
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Gly Gly Gin Gin Met Gly Arg Asp Leu Tyr Asp Asp Asp Asp Lys Asp 

20 25 30 

Arg Trp lie Arg Pro Arg Asp Leu Gin Met Gly Asp Phe Gly Pro Pro 

35 ~ ~ 40 45 

Val Pro lie Pro Glu Thr Glu val lie Gly Leu Ala Ser ser Ser Leu 

50 55 60 

Thr Asp Pro Glu Glu val Ser Val Leu Val Thr Gly Phe Gly Pro Phe 
65 70 75 80 

Lys Thr Asn Leu Val Asn Ala Ser Tyr Leu lie Ala Ser Ser Leu Pro 

85 90 95 

Glu Ser Leu Asp Leu Pro Ser Ala Lys Pro Ser Gly Ser Gly Pro Thr 

100 105 110 

Ser Arg Arg lie Ser lie His Val His Pro Ser Pro lie Pro val Ala 

115 120 125 

Tyr Ser Thr Val Arg Thr Thr lie Pro Thr lie Leu Glu Asp Tyr Ala 

130 ~ 135 140 

Lys Ser His Gly Gly Arg Arg Pro Asp lie Val Leu His Met Gly lie 
145 150 155 160 

Ala Ala Thr Arg Ser Tyr Tyr Ser lie Glu Thr Lys Ala His Arg Asp 

165 170 175 

Ser Tyr His Leu Ser Asp lie Lys Gly Arg lie Gly Tyr Glu Asp Gly 

180 185 190 

Glu Lys val Trp Arg Glu Gin Gin Leu Pro Pro val Leu Gin Ala Gly 

195 200 205 

Pro Ala Ala Asp Ser Thr Asp val val Arg Lys val Leu His Pro Gin 

210 215 220 

Pro Pro Asn Asp Asp Phe Leu Asn Thr Trp Lys Ser Phe val Ser Pro 
225 230 235 240 

Gly Ala Asp Val Arg lie Ser Glu Asp Ala Gly Arg Tyr Leu Cys Glu 

245 250 255 

Phe lie Phe Tyr Thr Ser Leu Ala Gin Ala Phe Gin Gin Gly Gin His 

260 265 270 

Arg Asn Val val Phe Phe His val Pro Gly Ser Cys Ala Asp Glu Asp 

275 280 285 

lie Glu Arg Gly Thr Asp lie Ala Ala Gly Leu lie Lys Ala Leu val 

290 295 300 

Arg Cys Trp Val Ser Glu Gin val 
305 310 
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